Automatic sentential vowel stress labelling

نویسنده

  • James Hieronymus
چکیده

There is general agreement that sentential syllable vowel stres!! (called prominence by some authors) in American English is marked by pitch risefalls, energy, and duration. None of these cues by themselves is sufficient, instead combinations of these cues are used by talkers to signal stress in continuous speech. After studying the stress marking strategies of 15 talkers of American English, an algorithm was devised which Iabels vowels with three Ievels of stress. The algorithm is based on combinations of pitch rise falls, relative energy and duration. The pitch is determined automatieally in all voiced regions in the sentence. Then the regions are characterised as having rising pitch, falling pitch or steady pitch. Sequences of three regions are examined to find the pitch rise fall patterns which signal stress. The energy in the band 0· 2500 Hz is determined throughout the utterance. All the energy measurements are made relative to the maximum energy in the sentence. If the energy of the vowel is within 11 db of the maximum it is considered energy stressed. The duration is determined from band Iabels in the present implementation. Duration is corrected for prepausal effects. If two out of three cues are present, then the vowel is Iabeiied stressed. If the vowel has the highest energy, Iongest duration, and highest pitch then it is labeled as highly stressed. If the vowel has very low energy relative to the loudest sound in the sentence, then it is labeled unstressed no matter what the other two cues indicate. The algorithm was tested on 125 sentences of American English and found to perform very weil. The pitch stress was the most difficult. Detailed analysis of the results show that approximately 85 % of the syllables are correctly stress labelled.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Labelling improves false belief understanding. A training study.

A total of 104 children aged between 41 and 47 months were selected to study the relationship between language and false belief understanding. Participants were assigned to four different training conditions: discourse, labelling, control (all with deceptive objects), and sentential complements (involving non-deceptive objects). Post-test results showed an improvement in children's false belief...

متن کامل

Production of English Lexical Stress by Persian EFL Learners

This study examines the phonetic properties of lexical stress in English produced by Persian speakers learning English as a foreign language. The four most reliable phonetic correlates of English lexical stress, namely fundamental frequency, duration, intensity, and vowel quality were measured across Persian speakers’ production of the stressed and unstressed syllables of five English disyllabi...

متن کامل

Automatic Prosody Labelling of read Norwegian

In this paper we present initial work on a method for automatic stress and boundary labelling of read EastNorwegian. The context of this work is automatic corpus annotation for unit selection speech synthesis. A phonological model of Norwegian prosody is described. The identification of syllable stress and major intonational boundaries are key prosodic events for building a prosodic description...

متن کامل

Using automatic stress extraction from audio for improved prosody modelling in speech synthesis

Generating proper and natural sounding prosody is one of the key interests of today’s speech synthesis research. An important factor in this effort is the availability of a precisely labelled speech corpus with adequate prosodic stress marking. Obtaining such a labelling constitutes a huge effort, whereas interannotator agreement scores are usually found far below 100%. Stress marking based on ...

متن کامل

Ouration as the Main Correlate of Lexical Stress in Italian

In a free-stress language like Italian, automatic detection of stressed syllables is of great importance for access to the lexicon. In continuous speech recognition systems this information can facilitate lexical access since it constrains the location of ward boundaries and can reduce the process of ward hypothesizing and matching. The paper summarizes the results of acoustic and perceptual st...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1989